Instrumental Assessment of Near-end Perceived Listening Effort
نویسنده
چکیده
Communication in noisy situations may be extremely stressful for the person located at the near-end side. Since the background noise originates from a natural environment, it cannot be reduced for the listener. Thus, the only possibility to improve this scenario with support of digital signal processing is the insertion of speech enhancement algorithms in the downlink direction of terminals. So far no measurement technique is available to evaluate the impact of signal processing techniques such as “near-end listening enhancements” [1] (NELE), artificial bandwidth extension (BWE) or additional noise reduction (NR). For mobile phones, acoustic testing in downlink direction is always carried out in silent condition. However, in several state-of-the-art devices the aforementioned algorithms are already included. This implies that a device may behave differently under noisy conditions than in silence: e.g. NELE algorithms may be triggered by a certain noise level and/or spectrum. Whenever speech processing is inserted into a conversation, quality aspects must be considered, too. A satisfactory balance between speech quality and listening effort is desirable from the user’s point of view. Currently, no reliable objective or instrumental methods are available to evaluate speech quality and listening effort of a device under test (DUT) in downlink in the presence of background noise. Any possible metrics should take into account ongoing trends in acoustic telecommunication measurement standards, i.e.: • Usage of real speech instead of artifical test signals. • Realistic playback of background noise scenarios (e.g. according to [2] or [3]). • “Black-Box-Approach”: no internals of a DUT are known, only outer measurements are available. Due to these requirements, several existing assessment measures targeting to intelligibility and/or speech quality aspects prove to be unfavorable: • STITEL, STIPA, RASTI according to [4]: shaped noise signals are used for measurement. • ITU-T Recommendations P.862 [5], [6] and P.863 [7]: noise or near-end noise is explicitly excluded in scope. • ETSI EG 202 396-3 [8] and TS 103 106 [9]: methods are specified for noise reduction scenarios and only for uplink direction. Another widely used measure for the instrumental intelligibility assessment is the speech intelligibility index (SII) [10]. Several drawbacks of this measurement algorithm should be considered, too: • Pure 1/3 octave level-based measure, no real psychoacoustical model (except frequency weighting) • Noise-free degraded speech signal is needed as input (not available in acoustic testing) Figure 1: Recording setup for (binaural) signal assessment
منابع مشابه
Listening Pre-tasks in Motivational and Cognitive Strategies Instruction and Quality of Subjective Experience: EFL Learners’ Perspectives
EFL learners may advocate the desire to have a fulfilling experience while doing tasks rather than focus solely on finishing them. However, learners' perspectives have been virtually ignored in the classroom task implementation. Thus, the current study attempted to explore the perceptions of Iranian EFL learners towards listening pre-tasks in motivational and cognitive strategies instruction a...
متن کاملSingle-Ended Prediction of Listening Effort Based on Automatic Speech Recognition
A new, single-ended, i.e. reference-free measure for the prediction of perceived listening effort of noisy speech is presented. It is based on phoneme posterior probabilities (or posteriorgrams) obtained from a deep neural network of an automatic speech recognition system. Additive noisy or other distortions of speech tend to smear the posteriorgrams. The smearing is quantified by a performance...
متن کاملبررسی ارتباط درک ذهنی و فیزیولوژیکی سختی کار در کارگران یکی از صنایع فلزی اصفهان
Background: The importance of ergonomics is to create fit between work and human physiology using assessment methods of physical, physiological and subjective evaluation. The most commonly used tool for assessment of subjective symptoms is Borg scale during physical work and heart rate in physiological situations. This study is based on a subjective and physiological assessment during hard work...
متن کاملOn the identification of relevant degradation indicators in super wideband listening quality assessment models
Recently, new objective speech quality evaluation methods, designed and adapted to new high voice quality contexts, have been developed. One interest of these methods is that they integrate voice quality perceptual dimensions reflecting the effects of frequency-response distortions, discontinuities, noise and/or speech level deviations respectively. This makes it possible to use these methods a...
متن کاملDiagnostic Instrumental Speech Quality Assessment in a Super-Wideband Context
Speech quality models usually estimate the integral quality of the degraded speech files. Such quality values do not inform system developers and telephone service providers on the perceived degradation introduced by the system under study. This paper describes a new intrusive speech quality model, called Diagnostic Instrumental Assessment of Listening quality (DIAL), providing diagnostic infor...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017